Ontology based ranking of documents using Graph Databases: a Big Data Approach
نویسندگان
چکیده
Today recruiters find their suitable human resources by searching in the job related web sites. In the same way, the job seekers also select their suitable jobs. The job seekers post their resumes in the multiple web sites. The HR tools available in the market make the job of recruiters easy by giving them the suitable resumes. Still, there is a good chance that the relevant documents may be missing in the list and unwanted document may exist in the list. This paper proposes a model for extracting resume information from different websites and makes the job of job recruiter easier by finding the suitable resume to fit their needs. Ontology is created with the suitable entities and their relationships for this domain. Each resume is split into four different sections namely – personal, education, skills and work experience. Attribute values are extracted from the resume documents. These values are updated in four different Resource Description Framework (RDF) files for each resume through ontology mapping. Resumes are ranked based on cosine similarity measure and then the ontologies are updated correspondingly. Here Graph database (i.e RDF), a NoSQL data model is used for storing the resumes and SPARQL is used for querying the documents. Experiments are carried out based on the retrieval time of relevant documents to find out the effective RDF model for storing the resumes.
منابع مشابه
An Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)
Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملA personalized search using a semantic distance measure in a graph-based ranking model
The goal of search personalization is to tailor search results to individual users by taking into account their profiles, which include their particular interests and preferences. As these latter are multiple and changing over time, personalization becomes effective when the search process takes into account the current user interest. This paper presents a search personalization approach that m...
متن کاملSemantic Clustering of Genomic Documents Using Go Terms as Feature Set
The biological databases generate huge volume of genomics and proteomics data. The sequence information is used by researches to find similarity of genes, proteins and to find other related information. The genomic sequence database consists of large number of attributes as annotations, represented for defining the sequences in Xml format. It is necessary to have proper mechanism to group the d...
متن کاملFinding Important Vocabulary Within Ontology
In current Semantic Web community, some researches have been done on ranking ontologies, while very little is paid to ranking vocabularies within ontology. However, finding important vocabularies within a given ontology will bring benefits to ontology indexing, ontology understanding and even ranking vocabularies from a global view. In this paper, Vocabulary Dependency Graph (VDG) is proposed t...
متن کامل